Interpretable Counterfactual Explanations Guided by Prototypes

نویسندگان

چکیده

We propose a fast, model agnostic method for finding interpretable counterfactual explanations of classifier predictions by using class prototypes. show that prototypes, obtained either an encoder or through specific k-d trees, significantly speed up the search instances and result in more explanations. quantitatively evaluate interpretability generated counterfactuals to illustrate effectiveness our on image tabular dataset, respectively MNIST Breast Cancer Wisconsin (Diagnostic). Additionally, we principled approach handle categorical variables Adult (Census) dataset. Our also eliminates computational bottleneck arises because numerical gradient evaluation black box models.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Explanations of Counterfactual Inferences

When engaging in counterfactual thought, people must imagine changes to the actual state of the world. In this study, we investigated how people reason about counterfactual scenarios by asking participants to make counterfactual inferences about a series of causal devices (i.e., answer questions such as If component X had not operated [had failed], would components Y, Z, and W have operated?) a...

متن کامل

Causal Explanations in Counterfactual Reasoning

This paper explores the role of causal explanations in evaluating counterfactual conditionals. In reasoning about what would have been the case if A had been true, the localist injunction to hold constant all the variables that causally influence whether A is true or not, is sometimes unreasonably constraining. We hypothesize that speakers may resolve this tension by including in their delibera...

متن کامل

MAGIX: Model Agnostic Globally Interpretable Explanations

Explaining the behavior of a black box machine learning model at the instance level is useful for building trust. However, what is also important is understanding how the model behaves globally. Such an understanding provides insight into both the data on which the model was trained and the generalization power of the rules it learned. We present here an approach that learns rules to explain gl...

متن کامل

Interpretable and Informative Explanations of Outcomes

In this paper, we solve the following data summarization problem: given a multi-dimensional data set augmented with a binary attribute, how can we construct an interpretable and informative summary of the factors affecting the binary attribute in terms of the combinations of values of the dimension attributes? We refer to such summaries as explanation tables. We show the hardness of constructin...

متن کامل

using counterfactual analysis for providing historical explanations in social sciences

counterfactual analysis is concerned with explaining events that have not happened. counterfactuals are mental experiments through which one can reconstruct hypothetical versions of the history in one’s mind; these versions are relatively different from the real history, but provide one with the opportunity to test historical hypotheses against the available evidence. historicist researchers in...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Lecture Notes in Computer Science

سال: 2021

ISSN: ['1611-3349', '0302-9743']

DOI: https://doi.org/10.1007/978-3-030-86520-7_40